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Abstract 

Realistic quantum mechanics based on complex probability theory is shown 
to have a frequency interpretation, to coexist with Bell's theorem, to be linear, 
to include wavefunctions which are expansions in eigenfunctions of Hermitian 
operators and to describe both pure and mixed systems. Illustrative examples 
are given. The quantum version of Bayesian inference is discussed. 



1. Introduction 

Quantum mechanics can be reformulated as a realistic theory where quantum effects 
arise from a breakdown of conventional probability theory rather than from the usual wave- 
particle duality. Consider, for example, the two slit experiment where a source emits a single 
particle which travels towards a wall with two slits and is detected at position a; on a screen 
located behind the wall. The usual argument concluding that an interference pattern on the 
screen implies that the particle did not either go through one slit or the other is ultimately 
an argument in probability theory where 



is the critical statement. As an alternative, it is possible to consider a realistic theory, where 
the particle does go through one slit or the other, but where (1) fails due to a failure of 
probability theory itself. The basic consequences of this idea and its consistency with Bell's 
theorem and with other limitations on local realism are discussed in references 1 and 2. Here 
we develop this approach in more detail with emphasis on insights which are not available 
in standard quantum mechanics. 

2. Probability Theory as Physics 

Probability theory is often introduced as a theory of experiments which may succeed 
or may fail due to some random influence. If an experiment is successful n times in N 
trials, one calls the large N limit of njN the "probability of success." Probability theory 
then results from assuming that these probabilities follow Kolmogorov's axioms.^ For our 
purposes, however, it is convenient to adopt the more general Bayesian view where, given 
a pair of propositions (a, 6), probability P(a, 6) gives a non-negative real number meant to 
measure how likely it is that h is true if a is known. ^ One can then follow Cox^ and assume 
that P(a, 6 A c) = F{P{a, b), P{a A b, c)) and P{a, -^b) = G{P{a, b)) for some fixed functions 
F and G and derive the Bayesian form of probability theory. A function P mapping pairs 
of propositions into the interval [0,1] is a probability ii 



for all propositions a, b and c. In its Bayesian form, probability theory is just a consistent 
way of assigning a "likelihood" in the interval [0,1] to any pair of propositions whether or 
not these propositions refer to the outcome of a random experiment.^ As we will see, the 



P(a;) = P(a; via slit 1) + P(a; via slit 2) 



(1) 



(a) P{a, bAc) = P{a, b)P{a A b, c), 

(b) P{a,b) + P{a,^b) = 1, 

(c) P{a,^a) = 



(2) 
(3) 
(4) 
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Bayesian view is essential for constructing and understanding the predictions of our extended 
probability theory. 

To derive a frequency interpretation for ordinary probabilities, let p be the probability of 
success in an experiment and note that by the central limit theorem, the number of successes 
n in N independent copies of the experiment is asymptotically gaussian with mean fi = Np 
and with a/fi proportional to Ij^/N. The probability for njN to be in any interval not 
containing p can then be made arbitrarily small by increasing N . Thus, we have a frequency 
interpretation provided that we assume that an arbitrarily small probability for njN to be 
in some interval means that njN is never actually observed to be in that interval, or, more 
generally, that P(a, 6) = means that h is never observed if a is known to be true. As 
innocent as it seems, this additional property brings probability theory into the domain 
of physics and leads to a question: does such a P exist? If we assume that there is no 
conventional probability P with this additional property. Cox's remaining axioms may still 
be preserved provided probabilities are allowed to be complex. Since Bayesian probabilities 
are not defined as frequencies, it is possible to consider complex probabilities provided a 
consistent frequency interpretation can be constructed after the fact (section 3). In the 
complex case. Cox's arguments follow as before and the resulting complex probability theory 
has exactly the same form as (2)-(4), only with a complex P. In order to distinguish complex 
probabilities from the conventional case, an arrow notation is used so that the complex 
probability that h is true given that a is known is denoted by "(a b)" rather than by 
"P[a,b)." The arrow is meant to suggest a numerical version of implication. 

We are now ready to attempt a description of quantum phenomena by combining 
complex probability with a simple definition of realism. An arrow function mapping pairs of 
propositions into the complex numbers is a quantum theory if 

I. The arrow is a complex probability: 
{a ^ b Ac) = {a ^ b){a Ab ^ c) 
{a^b) + {a^ -^b) = 1 
(a -la) = 

for all propositions a, b and c and if 

II. There exists a set U (the state space) and a real time t such that: 

xt /\yt = false a X ^y (II. a) 

{at ^ bt») = {at ^ Uf A btn) (Il.b) 

{at A Xf bt») = {xf bt») (II. c) 

for all x,y ^ U, for all propositions a, b and for all times t < t' < t" with Ut = ^ x^U^t- 

The realistic state axioms (Il.a-II.c) guarantee that a system cannot be in two states 
at once (II. a), that a system is in some state at each intermediate time (Il.b) and that the 
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(La) 
(Lb) 
(I.c) 



knowledge that a system is in some particular state makes all previous knowledge irrelevant 
(II. c). For the purposes of this paper, it is convenient to add the following technical 
assumptions: III. [/ is a measure space and for any times t < t' for any proposition at, 
(at Ufi) = Jx^ij{o,t ~^ ^t') ^^'^ {^t ~^ Ufi) is differentiable with respect to t' . Note that 
unlike the "state space" of conventional quantum mechanics, U is not a Hilbert space. 

Many simple facts from probability theory also follow in quantum theories. For all 
propositions a, h and c, (a ^ a) = 1, (a true) = 1, (a ^ false) = and (a ^ 6 V c) = 
(a ^ 6) + (a ^ c) — (a ^ 6 A c) and, if (a ^ 6) 7^ then (a A 6 ^ c) = (a A 6 ^ 6 A c). 
It is convenient to call a proposition normal ii J^^jj \et ^t'l"^ exists and is greater than 
zero for all t' > t and to let a time subscript on a set of propositions denote the or of all 
its elements: Wt = yw^VV^t- Following probability theory, propositions a and b are said to 
be independent ii [q A a ^ b) = [q ^ b) for all propositions q. The rest of this work is an 
exploration of the consequences of I-III. 



3. Reconstructing a Frequency Interpretation 

As discussed above, the physical meaning of conventional probability is fixed by the 
assumption that if proposition a is known and if P(a, b) = 0, then b never happens in real 
experiments. Since, however, complex probabilities may sum to zero without being zero 
individually, we make this assumption for (a b) only if 6 = a;^ for some x in the state 
space. More generally, if [/ is a quantum theory, t < t' , et is a. normal proposition and a and 
b are any propositions, define a function Prob by 



L^U \et at'^Xj, ^'^ 
L^U ^i' 



Prob{et,at>) = — (5) 



and 

L^TT |e/ afi A bfi A a;/'|^ 
Prob{et, at, A 6^,) = -^^^^ , ' (6) 

where, under conditions described below, Prob will be shown to have many of the properties of 
a conventional probability and, in particular, provides a frequency interpretation for quantum 
theories. 

Supposing that [/ is a quantum theory and is a set of propositions, we say that W 
supports probabilities with initial knowledge et if et is normal and if, for all t' > i, for all 

(a) Afi A A^, = false and C/^/ implies Wi> (7) 

(b) Prob{et, At, ABt') = Prob{et, At>) Prob{et A A^,, Bf) (8) 

(c) Prob{eu At>) + Prob{eu A^.) = 1 (9) 
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where the complement = W — A. Following the standard argument sketched in 
the previous section, if Prob has these properties for a set of propositions W, then the 
probabilities Prob[ei, A^i) have a frequency interpretation. The conditions under which W 
supports probabilities can be easily found. 

Theorem 1 (Orthogonality). If U is a quantum theory, et is a normal proposition and 
W is a set of propositions satisfying point a) above, then W supports probabilities with 
initial knowledge et if and only if, for all times t' > t and for all subsets A of W, 
Leui^t ^t' ^ ^t'Ti^t ^1' A x^,) + {et At> A xt>){et A^, A Xf)* = 0. 

Proof We need to show that W supports probabilities with initial knowledge et- 
Condition (a) above is already assumed, (b) follows easily from the definition of Prob, and 
(c) follows since [et xt') = {et WfAxti) = {et At' Axti)+{et A^,Axti) and therefore 
1 = lu ^ \et a^i'P = Prob{et, At')+Prob{et, Al,)+I{A)/ J^j \et aj^/p where 

I{A) is the integral in the theorem. Similarly for the reverse implication. 

Since the state space [/ of a quantum theory trivially satisfies the conditions of theorem 1, U 
supports probabilities with any normal initial knowledge, and given any A C U, Prob[et, At') 
has a proper frequency interpretation and thus predicts how often a system will be found in 
region A at time t' given that e is known at time t. 

It is useful to consider extending the definition of Prob to mixed times. If, for normal et 
with t <t' < t" we were to define 



Jx^U \et Af A Bt» A xt»\ 



Prob{et, At, A 5^,,) = ^^LLi J ^ ' (10) 



then 

r I A 1^ 

Prob{et, At, A Bf,) = Jt7 1^^ ^ t' ^ p^^^^^^ ^ ^^^^ ^^^^^ ^^^^ 

which does not generally satisfy (2) unless t' = t" . In particular, J^^jj \et Af A xt"^ = 
IxeU I Sy£U^^t ~^ ^yt'){yt' ~^ and, if one could take the square inside the integral 

(and assuming Prob^yt' ,Ut") = 1), (2) would be satisfied. Since this fails because of the 
"interference terms" involved in exchanging the square with the integral, this suggests that 
an appropriate classical limit would restore probability theory (2)-(4) for propositions with 
mixed times. In addition, this failure demonstrates the reason that Bell's theorem does 
not rule out quantum theories in spite of the fact that they are realistic and local. In 
Bell's analysis,^ two spin ^ particles in a singlet state are emitted towards two distant 
Stern-Gerlach magnets. Let et define the known orientations of the two magnets and the 
description of the initial singlet state and let M^// be a description of one of the possible 
results of the final measurements. Let t be the time when the singlet state is released, t" 
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be the time of the final measurement and let t < t < t . Bell's argument is an argument 
in probability theory beginning with an expansion in "hidden variable" A in state space U: 
P{et, Mi,,) = P{et, Uf A M^//) and so 

P{et,Mt.)= I P{et,Xt, AMt.)= I P{et, Xt,)P{et A Xt, , Mt.) (12) 
Jxeu Jxeu 

which, from our point of view, fails in the last step since t' ^ t". Thus, although Bell's 
theorem is usually interpreted as ruling out local realistic theories, in a more general context 
Bell's result actually shows that one must choose between local realism and conventional 
probability theory. Discussion of other limitations on local realistic theories can be found in 
reference 2. 

4. Examples of Quantum Theories 

In order to provide some examples, it is helpful to begin the process of classifying quantum 
theories by their global properties. For example, one expects many quantum theories to be 
time invariant in the sense that for any propositions a and 6, (a^ b^i) = (a^-i-r ~^ ^t'+r) 
all times t,t' ,t. If such a quantum theory is also conservative in the sense that C/^/ implies 
Ut for all t < t' , then we have 

Theorem 2. // a quantum theory U is conservative and time invariant, then there exists a 
function (j) : U ^ C and a complex number X such that [Ut x^i) = e^(*~*)0(a;) for all 
X ^ U and for all times t < t' . 

Proof Let t < t' < t" and /(r) = {Ut Ut+r). Then using the axioms and the 
conservative property, {Ut ^ Ut") = {Ut ^ Ut' A Ut") = {Ut ^ Uti){Uti Ut") and so 
f{a + 6) = f{a)f{h) for all real a,6 > 0. Since / is differentiable (by III), f{a) = e^"" for 
some complex A and {Ut xt') = {Ut Ut' Aa;^/) = {Ut Uti){Uti xt') and the theorem 
is proved if we let (j){x) = {Ut' xt') which is time independent by the time invariance 
property. 

In the conventional view of quantum mechanics, theorem 2 is puzzling. After assuming very 
little about the system, one has concluded that the wavefunction is an energy eigenstate 
$(a;,i) = e^^(/){x). What if the system is, in fact, not in an energy eigenstate but is instead 
in a superposition of two different energy eigenstates? To answer this question, recall that 
a complex probability {ao bt) is the best estimate of the truth of bt given that ao is 
known. However, if some additional fact X is also known, one should instead calculate 
{ao A X ^ bt). Thus, a wavefunction such as $ should not be mistaken for "the state of the 
system." $ merely represents the best estimate of the complex probabilities for xt{x G U) 
given that only Uq is known at i = 0. Since different observers may have different knowledge 
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about a system, they may also describe a single system with different wavefunctions. This 
also implies that if an observer does not know all the relevant facts about a system, the 
corresponding wavefunction may give incorrect predictions. This, however, is not a failure of 
quantum theory any more than it is a failure of probability theory when the usual analysis 
of throwing a die fails in the case of a loaded die. In both cases, the theories are successful 
only if the relevant facts are known. To answer the original question, it is not possible for 
a superposition of two energy eigenstates to be the optimal description of the system unless 
some additional fact X is known in addition to Uq- To take another example, consider a 
particle confined in a box containing some known but time invariant potential and where one 
initially knows only that the particle is somewhere in the left half of the box. One expects 
that, typically, this initial information will be become less useful as time goes on until 
finally, the initial knowledge has no influence on the optimal wavefunction. This suggests 
that we call a quantum theory forgetful if, for all propositions a, 6, c and for all times t, 
(at A bj^i c^") = (6^/ c^") in the limit t',t" oo. As might be expected, you can 
easily show that forgetful, conservative, time invariant quantum theories evolve into energy 
eigenstates independent of the initial knowledge of the system. 

For an example with more detailed predictions, consider a conservative, time invariant 
quantum theory U with a trap /3 G U in the sense that /3t =^ /3i' for all t < t' . Since U is 
conservative, a^i =^ ct^ where a = U — j3 [since a^i =^ C/^/ =^ Ut = (at V /3i)]. By arguments 
similar to the proof of theorem 2, one can show that for some complex A, ko, (cto cti) = e^* 
and (cto Pt) = ko{l — e^*). Given that the system is not initially trapped, the probability 
to remain outside the trap for an additional time t is 



Prob{ao, at) = . ^ — = . ,0 , r 1 ^ (^^) 



an 



d since /q, jcto ^ 2;^^ = /q, jcto ^ cti A aj^p = le'^^P J^, Icti a^ip and similarly 



Jp \ao xt\^ = Jp \ao f3tA xt\^ = \ko{l - e^^)\^ \ao A f3t ^ xt\^ and so 

where k(t) = |fcop Icto A /3i ^ a;^ P/ J^\at ^ x^l"^. If, in addition, U is forgetful, then k(t) 
is time independent in the large t limit and, assuming that A is real and negative, results in 
the usual exponential decay law. If /3 is a single point in the state space then, using II. c, k[t) 
is time independent for all t and if t is real and negative, has the expected large t behavior.^ 
As an even more detailed example, one can extract the complete dynamics of a scalar 
particle in a time invariant quantum theory with a state space U = R^. Since, for any 
initial proposition eo, the wavefunction (eo xt) is given by (eo xt) = {cq Uq A xt) = 
Jy^jji^o yo){yo Xt), the time development of any initial wavefunction is determined 
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by the "propagator" {xt a;^/). One can now construct a path integral by choosing times 
to < ti < t2 < . . . < tn with to = t and tn = t' . Letting Xj denote "xj G [/ is true at time 
tj," with Xq — X 3.11(1 Xfi — X , repeated appHcation of II. b gives 

(xo ^ Xn) = ... {xo ^ xi){xi ^ X2) X . . . X {xn-l ^ Xn). (15) 



As shown in reference 1, by repeating the same argument, each interval [xj ^j+l) ^^^^ 
be expanded into a sub-path integral which can then be reduced to a convolution by letting 
tj ^■iid inverted with a fourier transform. As a result, for small t' — t, \x' — x\, the 

propagator (xt x'^,) is 



{xt ^{x + z)t+r) = d , exp(- 

(27rr)2 ^det[i/] 



2 T J J J- 



) (16) 



where 1^0(2;), i'j{x), Ujj.^x) and Wj}.{x) are moments of /i(a;,z,r) = [xt [x-\-z)t+T) defined 
by uq{x) = J^^jj fie{x,z,0), Uj{x) = J^^jj fie{x, z,0)zj, and Ujk{x) = J^^jj fie{x, z,0)zjZi,, 
with Wjj, = MjiM^j^uJi where M is the matrix which diagonalizes Ujj, such that Mj'-Vjj^Mj,^ = 
Si^/uJl. With velocity Vj given by the limit of Zj/r, the above propagator is equivalent to 
the Lagrangian 

^x,v) = -{vj - iyj)Wjk{vk - n) - ii^O (17) 

which, by renaming the various moments, produces the Schrodinger equation for d = 3 
or the Klein-Gordon equation for d = 4: (with t identified as the proper time) where the 
particle mass, static vector and scalar potentials and metric {Wjjf.) all appear as moments 
of (xt a;^/). Notice that in contrast to the usual procedure, we have not assumed that the 
action is given by a classical Lagrangian or that the theory is Lorenz or gauge invariant. 

Quantum theories also provide a convenient way to incorporate assumptions about an 
experiment directly into complex probabilities even if a solution to the full dynamics is 
not available. Consider, for example, the simple interferometer depicted in figure 1 where 
a photon encounters a beam splitter (^i) followed by a mirror {Mi or M2) and a second 
beam splitter (5*2) thus reaching detector Di or D2 via path Pi and Qi or via path P2 and 
Q2. To simplify matters, ignore the photon polarization and consider a quantum theory 
with U = . Let e represent the initial description of the apparatus and photon, let Pj 
= "The photon is on the path Pj at a time t after the first beam splitter is encountered 
but before the mirror is encountered," and similarly for Qj and let proposition Dj be true 
if the photon reaches some chosen point at the entrance to detector Dj. Suppress time 
subscripts for convenience. Then, assuming that Dj implies both Pi V P2 and Qi y Q2, 
(e ^ Dj) = (e ^ (Pi V P2) A {Qi V Q2) A Dj) and using Pi ^P2 = Ql^Q2 = false, 

(e ^ Dj) = ^(e ^ Pn){e A Pn ^ Qm){e A Pn A Qm ^ Dj). (18) 
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Figure 1. A photon interferometer as described in the text where a single photon 
(-y) encounters a beam splitter (Si), one of two mirrors (Mi or M2) and a second 
beam splitter (S2) before entering one of two detectors (Di or D2)- The photon 
may either reach its detector by path Pi and Qi or by path P2 and Q2- 

If Pn and Qm are single points in U , then using II. c and assuming that (P^i ^ Qm) is zero 
unless n = m, (e Dj) is given by the sum of the complex probabilities for the two paths 

2 

(e ^ Dj) = ^(e ^ Pn){Pn ^ Qn){Qn ^ Dj). (19) 
n=l 

Assume that (e ^ Pn) = fcie^r^=2>/2^ (p^ ^ q^^ ^ ^q^ _^ j^.^ ^ f^^^i\n^j]Tr/2 

where ki and ^2 are complex constants and where [a] denotes the function which is 1 if a 
is true and if a is false. Thus, (e Di) = by cancellation of the two paths and since 
Prob(e, Di) is proportional to |(e Di)\'^, there will be no counts observed in detector Di. 
Suppose, now, that the mirror Mi is attached to a spring and to a device H which has two 
states: "hit" if the oscillation in the spring has ever been above some threshold and "nohit" 
otherwise. Equation (18) follows as before, but (19) does not since Pn is no longer a point 
in the enlarged state space U' = X {hit, nohit}. Use subscripts e, P, Q and D to indicate 
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times, so that hitg denotes "H is in state 'hit' at the time when Q is evaluated" etc., and 
let e' = e A nohitg. By the definition of Prob in the new state space U' , Prob(e' , Di) is 
proportional to |e Di A hit^)] + |e ^ Di A nohit^)] • If H is completely successful 
at measuring whether a photon has struck Mi, then we can assume that hit^) ^ Qi and 
nohit^) ^ Q2 and hitj) =^ hitg =^ nohitp. Thus, 

(e' ^ i)i A hit 23) = (e' ^ Pi A nohitp A (^i A hitg A A hit 23) 

= (e' ^ Pi A nohitp)(Pi A nohitp ^ Qi N }niQ){Qi A hitg ^ P>i A hit 23) (20) 

and since there are no longer two terms to interfere, Prob(e' , Di) is no longer zero. At the 
other extreme, if H is far from sensitive enough to determine if Mi has been struck by the 
photon, we can assume that hit^ and nohitp are independent of Pj, Qj and Dj for all t. Then 
(e' Di A hit^)) is equal to 

2 

^ (e' ^ hp){hp ^ h'Q){h'Q ^ hit^) ^(e' ^ P,)(P, ^ ^ Di) (21) 

/i,/i'e{hit,nohit} n=l 

and thus (e' ^ Di A hit^)) = (e' ^ Di A nohit^)) = and the previous prediction of no hits 
in Di is restored, as expected. The same mechanism for removing irrelevant propositions 
by assuming independence is also the source of the "locality" of quantum theories. For 
example, if two unrelated experiments ei and 62 have possible results ri and r2 respectively, 
then the assumptions that {t"i,t"2}j {ei,r"2} and {e2,r"i} are independent allow the expected 
conclusion (e^ A 62 ^ r"i A = (e^ fl){s2 ~^ '"2)- 



5. Basic Properties of Quantum Theories 

In section 4, we have seen that an energy eigenfunction should not be viewed as a state 
of the system, but rather as the best description of a system given certain initial knowledge. 
In particular, if only Uq is known, an energy eigenstate is the best description of the system 
in a conservative, time invariant quantum theory. In effect, energy eigenstates arise from a 
nearly complete lack of knowledge of where the system is in the state space. As might be 
expected, eigenfunctions of other operators arise in a similar way. Given a quantum theory 
U with normal initial knowledge Cq and given a Hermitian operator Q, let 

where V C U and ^(x) = (eo Xq)- If ^ and are continuous and if there is no reason 
to choose q{V) ^ 9(^') for any V, C C/, ^ must be chosen from among the eigenfunctions 
oi Q. If Q^j = ^j(l>j defines the eigenfunctions of Q, let 

6j = "The system, at time i = 0, is best described by (23) 
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so that bj Abj. = false ii j ^ k and, by assumption, y jbj = true. Therefore 



(eo Xo) = (eo ^ jbj A Xq) = 2^(eo ^ bj){eo A 6_ 



) = 5](eo ^ 6,)0,(^) (24) 
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which provides an expansion of the initial wavefunction in eigenfunctions of Q. Since Q 
is Hermitian, its eigenfunctions are orthogonal, and, by theorem 1, {bi,b2,...} supports 
probabilities with initial knowledge eo and so 



has a frequency interpretation. To put this expansion in a more familiar form, let 



of Q and where Prob{e,bj) = \aj\ ■ We have thus derived what is usually taken as the 
expansion postulate in conventional quantum mechanics. 

In conventional quantum mechanics, a sharp distinction is made between pure states, 
which can be described by a single wavefunction and statistical mixtures, which must, 
in general, be described by a density matrix. Since probability theory itself is no longer 
available to us, these "statistical mixtures" must be described entirely within complex 
probability theory. To investigate this issue, consider several situations which require density 
matrices in conventional theory. First, consider a system with initial knowledge eo which 
is known to be well described by one of the wavefunctions V'1)V'2) • • • which may or may 
not be orthogonal. This would normally be represented as a mixture. As before, we have 
(eo Xo) = Y^ji^o bj)'^j{x) where bj = "The system at i = is best described by V'j-" 
Thus, in a quantum theory, not knowing which tl^j best describes a system is no different 
from a pure superposition oiijjj. To put it another way, all such expansions can be considered 
as mixtures with, in general, complex probabilities as coefficients and where a "statistical" 
mixture is only a special case. Density matrices are also needed in the case of "open systems" 
where 5" C C/ is "the system" and R = U — S is the rest of the world. If e^ summarizes 
the initial knowledge of the system S, then with x ^ S, il^{x) = [eg Xq) is the initial 
wavefunction, as usual. In quantum theory, the single function il^{x) is sufficient to describe 
S independent of what is known about the rest of the world if, for propositions eji about 
the rest of the world, (e^ A ejj ^ Xq) = (e^ Xq) which is just the condition that eji and 
Xo are independent. As a final example, consider a composite quantum theory U = Ua X 
where Hermitian operator A satisfies A(j)j = cbj(j)j [(j)j : Ua ^ C) and Hermitian operator B 
satisfies -B-^^ = fc^^V'/fc {i^k '• ~^ Following reference 10, evaluate the expectation value 
of A. For convenience, let all propositions implicitly be evaluated at time t = and define aj 
= "Ua is best described by (j)j" and 6^ = "C/j is best described by V'/fc-" With initial normal 



Prob{eo, bj) 




(25) 
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knowledge e, {ai, • • •} supports probabilities by virtue of theorem 1 and so we can define 
the expectation value < A >= ajProb[e, aj). Using the definition of Proh, 

< A >= / ^ ^ ^bTi^ (^j Axa A xi,)/Z (26) 

j Jxa&Ua Jxi^Uh 

where Z = Jjj Jjj^ \e Xa A x^l"^ . Since V^6^ = true and using the orthogonality of 
(e aj A bj. A Xa A x^) in both indices, 

<i>= V / / {e^ aj AbkAxaAxj,)*A{e^ amAbkAxaAyj,)/Z (27) 

and so 

< 1 >= "V |e ^ 6^1^ / / {e Abj, ^ xa Axi,)*A{e Abj, ^ xa Axi,)/Z. (28) 

Let the wavefunction "relative to bj," be defined by ^^gi{xa, xi) = {e A bj, Xa A 
^h)lUua -fub leAbk^XaA aj^p)!. Then 

<A>=J2 Prob{e, bk) I ^ti{x)A^lA^) (29) 

agrees with the result from reference 10 where a demonstration is also given that no single 
wavefunction defined on Ua gives the correct marginal distributions for all operators defined 
on Ua- 

From these examples, we conclude that quantum theories are able to naturally describe 
mixtures without requiring extension of the axioms. Of course, in the case of the spin of an 
electron, a density matrix rather than a superposition is required to represent, for example, 
an unpolarized particle. Although we do not treat spin here, this observation indicates that 
spin cannot be described by a quantum theory with a two element state space. 

There has been special recent interest in precision tests of quantum mechanics and in 
the question of whether non-linear extensions of quantum mechanics are possible. ■'^■'^ Since 
linearity is assumed in conventional quantum mechanics, it is not possible to investigate this 
question within the usual framework. We can, however, show that linearity is a consequence 
of axioms I-III. Let proposition e be the description of an experiment and let £' be a set of 
propositions describing a set of possible initial conditions. For convenience, suppress a i = 
subscript on all propositions and let ^a{x) = (e A a ^ a;) for any proposition a. Introduce 
a = "The system is best described by $a" and b = "The system is best described by 
and let z(e, a, 6) = {((e A e' ^ a), (e A e' ^ 6)) : e' G £'}. Linearity of quantum theories 
is then guaranteed by the following. 
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Theorem 3 (Linearity). If e is a proposition and E is a set of propositions and U is a quantum 
theory, then, for any pair (ct, /3) G z[e, E, a, b) such that a-\- /3 ^ 0, there exists a proposition 
c such that {a + ^)^c{x) = a^a{x) + /3*fc(a;). 

Proof. Let {a,/3) G z(e, E, a, b) and let e' satisfy (e A e' ^ a) = ct and (e A e' ^ 6) = /3. 
The case where a = 6 is triviaL Otherwise, let w = e A e' A (a V 6). We have (w ^ x) = (w ^ 
(a V 6) A a;) [since (e A e' ^ a V 6) = ct + /3 7^ 0] which equals (w ^ a Ax) -\- (w ^ b Ax) which 
equals [w a)'9a{x) + {w 6)$j(a;). Since [w ^ a) = a/[a-\-/3) and [w ^ b) = j3 j {^a-\- j3), 
the theorem is proven if we let c = e' A (a V 6). 

Since the proof of theorem 3 only uses the complex probability axioms, the linearity of 
quantum theories is a direct result of the addition of probabilities in Lb and so a non-linear 
extension of quantum mechanics would not be consistent with complex probability theory 
or with Cox's axioms. 

6. The Bayesian View of Complex Probabilities 

One of the benefits of realistic quantum theories based on complex probability is a simple 
explanation of the puzzling, not quite paradoxical problems in quantum mechanics involving 
the collapse of the wavefunction, non-local effects and the role of the observer in the theory. 
Within realistic quantum theories, all of these problems are ultimately due to mistaking the 
wavefunction for the state of the system as discussed in section 4. Here we illustrate how 

1 

the idea that a wavefunction represents knowledge about a system and not it's state arises 
in the Bayesian view of complex probabilities. 

The puzzling phenomenon of wavefunction collapse can be illustrated by imagining a 
wave packet which strikes a barrier causing part of the wave packet to be reflected to the 
left and part to be transmitted to the right. After some time, a box on the right is sealed 
and a device tests (with 100% efficiency) whether there is a particle in the box. If the test 
is successful, a repeated measurement must also be successful with probability 1, and so the 
wave packet on the left must, instantaneously and mysteriously, travel to the box on the right, 
penetrate its walls and join the right half of the wave packet. However, from the point of 
view of a realistic quantum theory, after striking the barrier, the particle did either go to the 
left or the right and the diverging wave packets only represent the fact that we do not know 
which way it actually went. The collapse of the wavefunction just corresponds to learning 
a new fact about the system and using that fact in calculating new complex probabilities. 
Since this is only a change in the description of the system, it does not correspond to 
anything actually happening to the particle, the box or to the observer. Similarly, in the 
EPR experiment as described above in the discussion of Bell's theorem, the measurement 
of one of the spins has no effect on the remote particle and so, for instance, it should not 
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be possible to use EPR correlations for non-local communication. This difference between 
viewing the wavefunction as a representation of what one knows about a system rather than 
its state is directly analogous to the difference between the classical view of a probability as 
something determined by an underlying random phenomenon and the Bayesian view where 
a probability represents what one happens to know about a system.^ 

The same point can be illustrated once again by Schrodinger's cat, who is put in an opaque 
box with a device which will kill it with probability ^. In the mysterious version, after the 
device has acted, it would not be correct to say that the cat is either alive or dead until an 
observer opens the box and looks, causing the wavefunction of the cat to collapse into one of 
the two alternatives. However, consider, for comparison, the classical problem of a six sided 
die in an opaque box. If the upward face of the die is unknown, a Bayesian would describe 
this situation with a "superposition" pj = 1/6, j = 1,...,6 (determined, for instance, by 
maximizing entropy) and there would thus be a "collapse of the probability distribution" 
when the box is opened and the die face is revealed. However, the "superposition" pj and 
collapse is not mysterious at all precisely because pj is not mistaken for the "state of the 
die." Similarly, the mystery of Schrodinger's cat disappears if one takes the Bayesian view 
of complex probabilities. 

Since, as we have seen, wavefunctions only represent what is known about a system 
rather than its state, quantum theory should also be useful for systematically improving 
wavefunctions based on prior information in the same sense that Bayesian inference^ is 
used to improve probability distributions. Analogues of both components of Bayesian 
inference are available in quantum theories since Bayes theorem follows [if [a ^ b) ^ 0, 
then (a A 6 ^ c) = (a ^ c)(a A c ^ ^)/{^ ~^ ^)] ^^^^ since a suitable maximum entropy 
principle has already been proposed and successfully used in many-body problems. ■'^^ 

7. Summary 

Starting with complex probability theory and a simple statement of realism, we are able to 
derive much of what is assumed in conventional quantum mechanics including the probability 
interpretation, the superposition principle, the expansion postulate and the wave equation 
including static fields and a metric. These quantum theories have a convenient treatment 
of mixed systems which are represented without extension to the axioms. The Bayesian 
view of complex probabilities provides an easy understanding of the EPR experiment. Bell's 
theorem and the other mysterious problems in conventional quantum mechanics and suggests 
a program for improving wave functions analogous to Bayesian inference in probability 
theory. It remains to be seen whether particles with spin, multi-particle systems, bosons and 
fermions and field theories can also be based on complex probability theory. Open questions 
about the fundamentals of quantum theories also remain. In particular, the sense in which 
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probability theory is restored in the classical limit needs to be quantified. It might also be 
possible to show that the state axioms are a consequence of the complex probability axioms 
in the sense that they are required for a frequency interpretation to exist. 
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